CDS

Accession Number TCMCG075C14707
gbkey CDS
Protein Id XP_007035525.2
Location join(30424433..30424540,30425213..30425311,30425534..30425596,30425763..30425840,30426268..30426457,30426682..30426941,30427024..30427402,30427864..30427941,30428928..30429148,30429288..30429350,30430119..30430442,30430529..30430707,30430785..30431067,30431339..30431575,30431769..30431930,30432345..30432488,30432583..30432801,30433159..30433458,30433575..30433739)
Gene LOC18603464
GeneID 18603464
Organism Theobroma cacao

Protein

Length 1183aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_007035463.2
Definition PREDICTED: protein ALWAYS EARLY 3 isoform X3 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category K
Description Protein ALWAYS EARLY
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
KEGG_ko ko:K21773        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04218        [VIEW IN KEGG]
map04218        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGCGCCATCTAGAAAATCTAAAAGTGTAAATAAGAAGTTTTCTTATGTTAATGAGGTTGCTTCTAGTAAAGATGGAGATAGTAGTGCTAAGAGAAGCGGGCAACGGAAAAGGAAGTTGTCTGACATGTTAGGGCCTCAATGGACTAAGGAAGAGCTTGAGCGTTTCTATGAAGCGTATCGCAAGTATGGGAAAGATTGGAAGAAGGTTGCTACTGTGGTACGAAATCGATCTGTGGAAATGGTAGAAGCTCTGTACACTATGAATAGGGCCTACTTATCTCTCCCGGAAGGCACTGCTTCTGTGGTTGGACTCATAGCGATGATGACTGATCACTATTGTGTTATGGGAGGAAGTGATAGTGAACAAGAAAGCAATGAGGGCGTGGGAGCTTCTCGGAAACCTCAGAAGCGTAGTAGGGGAAAACTTCGAGATCAACCCTCTAAAAGTTTAGATAAGTCATTTCCTGATCTTTTGCAATTTCATTCAGCTGCATCAAGTTATGGTTGCTTGTCATTGTTGAAGAGGAGACGCTCTGAAAGTAGGCCCCGTGCTGTTGGAAAAAGGACTCCTCGTGTTCCTATTTCTTTTTCTCATGACAAAAACAAAGGAGAAAGGTACTTTTCACCTATTAGGCAGGGCATGAAACTAAAGGTGGATACCGTTGATGATGATGTTGCTCATGAGATAGCATTAGTTTTGACGGAGGCATCACAAAGAGGTGGATCTCCTCAAGTTTCTCGAACACCAAACAGAAAAGCAGAGGCATCTTCACCTATTCTCAACAGTGAAAGGATGAATGCTGAGTCAGAAACTACTAGTGCCAAGATTCATGGTAGTGAAATGGATGAGGATGCTTGTGAATTGAGCTTAGGAAGCACTGAAGCTGATAATGCTGATTATGCTAGAGGTAAAAATTATTCAATGAATATAGAAGGGACTGGTACCATTGAAGTTCAACAGAAGGGAAAAAGATACTACAGAAGGAAGCCAGGGGTTGAGGAAAGTGTAAACAATCATCTGGAAGACACAAAAGAAGCCTGTAGTGGGACGGAAGAAGATCAAAAGTTATGTGATTTCAAGGGAAAGTTTGAAGCAGAGGTTGCAGATACCAAACCTTCTAGAGGCTCCATCAAGGGTCTAAGGAAAAGAAGTAAAAAAGTGTTGTTTGGGAGAGTTGAAGACACTTCCTTTGATGCCCTGCAAACTCTAGCAGATCTGTCCTTGATGATGCCAGAAACTGCTGCTGATACTGAGTCATCTGTGCAGTTCAAGGAAGAGAAAAATGAAGTTGTTGAGAAGACTAAACTGAAAGGAAACCATCCTGTTTCTGGAGCTAAAGGCACTGCCCCCAAAACATGTAAACAGGGAAAAGTTTTTGGTCATGATGTTCGTGCTATTCCCGAGGCAAAGGAGGAAACACACCCAGGTAATGTTGGAATGCGGAAAAGGAGACAGAAGTCCTCACCATATAAATTGCAGATTCCAAAAGATGAAACTGATGCTGATTCTCATTTGGGTGAATCTCGAAACATTGAGGCTTTAGATGAGGTAAAGAATTTTCCAAGCAAAGGTAAACGCTCTAATAATGTTGCACATTCAAAGCAAGGGAAATCAGTGAGACCTCCAGAGCATCGTTCCTCAAGTACTGATCATGGAAGGGACTTGAACAATTCAGCTCCATCTACCATACAGGTTTCACCTGTTAACCAGGTCAACCTACCCACAAAAGTCAGGAGTAAGAGAAAGATAGATGCACAGAAACAAGTGATTGGGAAGGATATAAAGTCCTCTGATGGTATTGTGAAGGGAAAATTTAGTGTTCCAGTTAGTTTATTCCATGACAGAGCACTCAATCTGAAGGAAAAGCTTTGTAACTTCCTATGTCCATATCAAGCACGGAGATGGTGTACCTTTGAGTGGTTCTGTAGTACAATTGATTATCCATGGTTTGCTAAAAGGGAGTTTGTGGAGTATTTGGATCATGTAGGATTGGGTCATGTTCCAAGATTAACTCGTGTTGAATGGGGTGTCATAAGGAGTTCCCTTGGCAAGCCACGAAGGTTTTCTGAGCAATTTTTGAAGGAAGAAAGAGAGAAGCTTTATCAATATCGGGAATCTGTTAGAACGCATTATGCTGAACTCCGTGCTGGTATTGGTGAAGGACTTCCAACTGATTTAGCTCGACCTCTATCAGTTGGACAGCGTGTTATTGCTATTCATCCAAAAACTAGAGAGATTCATGATGGAAATGTGTTAATTGTTGACCATAGTAGGTACCGGATTCAATTTGACAGCACTGAGCTAGGAGTGGAATCTGTCATGGATATTGATTGTATGGCTTTAAATCCATTGGAAAATTTGCCTGCTTCCCTTGTGAGACAAAATGCTGCTGTCAGGAAATTTTTTGAGAACTACAATGAGCTCAAAATGAACGGGCAGCCAAAAGAAAGCAAGATGGAAGAGAACATCAAATTTGCTTCGTGTGAGGAGAATGCCAATAGTCCCTCTCGAACTTCCCCATCAACTTTCAGTGTTGGCAATTTATCACAACTTGTTAAGGTTGATCCATCAAGTCCTAATTTACAACTTAAAGTTGGGCCTATGGAAACTGTTTATACTCAGCAGGCAGTAAATTCCCAGCCTTCTGCTCTGGCGCTGATACAGGCGAGGGAAGCTGATGTTGAAGCTCTTTCTCAGTTGACTCGTGCTCTTGACAAAAAGGAGGCTGTGGTCTCTGAACTACGGCGTATGAATGATGAGGTGTTGGAAAACCAGAAAGGTGGGGACAACTCTATAAAGGATTCAGATTCTTTCAAGAAGCAATATGCTGCTGTTCTTTTACAGTTAAATGAAGTCAATGAGCAGGTTTCTTCTGCTCTCTTTTCCTTGAGGCAACGCAATACATATCAAGGGACCTCCTCAGTTAGATTGCTGAAGCCCTTGGCTAAAATTGGTGAGCATGGTTGTCAGTTGAGCTCTTTTGATCATTCTATGCATCATGCCCAAGAATCTGTATCCCATGTGGCTGAAATTGTTGAAAGTTCAAGAACGAAAGCTCGGTCAATGGTGGATGCAGCTATGCAGGCTATGTCATCCTTGAGAAAAGGGGGGAAAAGCATCGAGAGGATTGAGGACGCAATAGATTTTGTAAATAACCAGCTTTCGGTGGATGATCTTAGTGTGCCTGCTCCGCGGTCTTCTATCCCAATAGACTCAGCCCACAGTACGGTAACTTTTCACGATCATCTCACTGCCTTTGTGTCAAATCCACTGGCAACTGGTCATGCACCTGATACAAAGTTGCAAAATTCGTCTGACCAAGACGATCTTAGAATCCCTTCAGACCTTATCGTGCATTGTGTAGCCACCTTGCTCATGATTCAGAAGTGTACAGAAAGGCAGTTTCCACCTGGAGATGTTGCCCAGGTACTAGATTCTGCTGTTACTAGTTTGAAGCCGTGTTGTTCACAAAATCTCTCAATTTATGCAGAGATACAGAAATGTATGGGAATTATTAGGAACCAGATATTGGCGCTGGTACCTACATAG
Protein:  
MAPSRKSKSVNKKFSYVNEVASSKDGDSSAKRSGQRKRKLSDMLGPQWTKEELERFYEAYRKYGKDWKKVATVVRNRSVEMVEALYTMNRAYLSLPEGTASVVGLIAMMTDHYCVMGGSDSEQESNEGVGASRKPQKRSRGKLRDQPSKSLDKSFPDLLQFHSAASSYGCLSLLKRRRSESRPRAVGKRTPRVPISFSHDKNKGERYFSPIRQGMKLKVDTVDDDVAHEIALVLTEASQRGGSPQVSRTPNRKAEASSPILNSERMNAESETTSAKIHGSEMDEDACELSLGSTEADNADYARGKNYSMNIEGTGTIEVQQKGKRYYRRKPGVEESVNNHLEDTKEACSGTEEDQKLCDFKGKFEAEVADTKPSRGSIKGLRKRSKKVLFGRVEDTSFDALQTLADLSLMMPETAADTESSVQFKEEKNEVVEKTKLKGNHPVSGAKGTAPKTCKQGKVFGHDVRAIPEAKEETHPGNVGMRKRRQKSSPYKLQIPKDETDADSHLGESRNIEALDEVKNFPSKGKRSNNVAHSKQGKSVRPPEHRSSSTDHGRDLNNSAPSTIQVSPVNQVNLPTKVRSKRKIDAQKQVIGKDIKSSDGIVKGKFSVPVSLFHDRALNLKEKLCNFLCPYQARRWCTFEWFCSTIDYPWFAKREFVEYLDHVGLGHVPRLTRVEWGVIRSSLGKPRRFSEQFLKEEREKLYQYRESVRTHYAELRAGIGEGLPTDLARPLSVGQRVIAIHPKTREIHDGNVLIVDHSRYRIQFDSTELGVESVMDIDCMALNPLENLPASLVRQNAAVRKFFENYNELKMNGQPKESKMEENIKFASCEENANSPSRTSPSTFSVGNLSQLVKVDPSSPNLQLKVGPMETVYTQQAVNSQPSALALIQAREADVEALSQLTRALDKKEAVVSELRRMNDEVLENQKGGDNSIKDSDSFKKQYAAVLLQLNEVNEQVSSALFSLRQRNTYQGTSSVRLLKPLAKIGEHGCQLSSFDHSMHHAQESVSHVAEIVESSRTKARSMVDAAMQAMSSLRKGGKSIERIEDAIDFVNNQLSVDDLSVPAPRSSIPIDSAHSTVTFHDHLTAFVSNPLATGHAPDTKLQNSSDQDDLRIPSDLIVHCVATLLMIQKCTERQFPPGDVAQVLDSAVTSLKPCCSQNLSIYAEIQKCMGIIRNQILALVPT